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ABSTRACT 

The emergence of Middle East respiratory syndrome showed once again that 
coronaviruses (CoVs) in animals are potential source for epidemics in humans. To 
explore the diversity of deltacoronaviruses in animals in the Middle East, we tested 
fecal samples from 1,356 mammals and birds in Dubai. Four novel deltacoronaviruses 
were detected from eight birds of four species by RT-PCR: FalCoV UAE-HKU27 
from a falcon, HouCoV UAE-HKU28 from а houbara bustard, PICoV UAE-HKU29 
from a pigeon and QuaCoV UAE-HKU30 from five quails. Complete genome 
sequencing showed that FalCoV UAE-HKU27, HouCoV UAE-HKU28 апа PiCoV 
UAE-HKU29 belong to the same CoV species, suggesting recent interspecies 
transmission between falcons and their preys, houbara bustards and pigeons possibly 
along the food chain. Western blot detected specific anti-FalCoV UAE-HKU27 
antibodies in 33 (75%) of 44 falcon serum samples, supporting genuine infection in 
falcons after virus acquisition. QuaCoV UAE-HKU30 belongs to the same CoV 
species as PorCoV HKU15 and SpCoV HKUI7 discovered previously from swine 
and tree sparrows respectively, supporting avian-to-swine transmission. 
Recombination involving the spike protein is common among deltacoronaviruses, 
which may facilitate cross-species transmission. FalCoV UAE-HKU27, HouCoV 
UAE-HKU28 and PiCoV UAE-HKU29 were originated from recombination between 
WECoV HKU16 and MRCoV HKU18; QuaCoV UAE-HKU30 from recombination 
between PorCoV HKU15/SpCoV HKU17 and MunCoV HKUI3, and PorCoV 
HKUIS from recombination between SpCoV HKU17 and BuCoV HKUII. Birds in 
the Middle East are hosts for diverse deltacoronaviruses with potential for interspecies 


transmission. 
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IMPORTANCE 

During an attempt to explore the diversity of deltacoronaviruses among mammals and 
birds in Dubai, four novel deltacoronaviruses were detected in fecal samples from 
eight birds of four different species: FalCoV UAE-HKU27 from a falcon, HouCoV 
UAE-HKU28 from a houbara bustard, PiCoV UAE-HKU29 from a pigeon and 
QuaCoV ЏАЕ-НКОЗО from five quails. Genome analysis revealed evidence of recent 
interspecies transmission between falcons and their preys, houbara bustards and 
pigeons possibly along the food chain, as well as avian-to-swine transmission. 
Recombination, which is known to occur frequently in some coronaviruses, was also 
common among these deltacoronaviruses and predominantly occurred at the spike 
region. Such recombination, involving the receptor binding protein, may contribute to 
the emergence of new viruses capable of infecting new hosts. Birds in the Middle East 


are hosts for diverse deltacoronaviruses with potential for interspecies transmission. 
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INTRODUCTION 
Coronaviruses (CoVs) infect humans and a wide variety of animals, causing 
respiratory, enteric, hepatic, and neurological diseases of varying severity. Based on 
genotypic and serological characterization, CoVs were traditionally divided into three 
distinct groups (1-3). In 2008, the Coronavirus Study Group of the International 
Committee for Taxonomy of Viruses (ICTV) has replaced the traditional groups 1, 2 
and 3 CoVs with three genera, Alphacoronavirus, Betacoronavirus апа 
Gammacoronavirus (4). As a result of the unique mechanism of viral replication, 
CoVs have a high frequency of recombination (1). Their tendency for recombination 
and the inherently high mutation rates typical of RNA viruses may allow rapid 
adaptation to new hosts and ecological niches (5—9). 

The severe acute respiratory syndrome (SARS) epidemic and the discovery of 
SARS coronavirus (SARS-CoV)-like viruses from palm civets in China have boosted 
interests in discovery of novel CoVs in both humans and animals (10—15). In 2004, a 
novel human CoV (HCoV) of the genus Alphacoronavirus, human coronavirus NL63 
(HCoV NL63), was reported independently by two groups in 2004 (16, 17). In 2005, 
we also described the discovery of another novel HCoV, human coronavirus HKUI 
(HCoV HKU1), under the genus Betacoronavirus (18—20). As for animal CoVs, we 
and others have described the discovery of SARS-CoV-like viruses in horseshoe bats 
in Hong Kong and other provinces of China (21, 22). Based on these findings, we 
expanded molecular surveillance studies to examine the diversity of CoVs in bats of 
southern China, during which at least nine other novel CoVs were discovered, 
including two novel lineages in Betacoronavirus, lineage C and D (23—25). Other 


novel CoVs in bats and other animals have also been discovered by various research 
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groups, which has broadened our knowledge on the diversity and evolution of CoVs 
(7, 26—29). 

Birds are the reservoir of major emerging viruses, most notably, avian 
influenza viruses (30). Due to their flocking behavior and abilities to fly over long 
distances, birds have the potential to disseminate emerging viruses efficiently among 
themselves and to other animals and human. Yet, the number of known CoVs in birds 
is relatively small as compared to bats. In 2009, we described the discovery of three 
novel CoVs in three families of birds, named bulbul coronavirus HKU11 (BuCoV 
HKU11), thrush coronavirus HKU12 (ThCoV HKUI2) and munia coronavirus 
HKU13 (MunCoV HKU13) (31). These three CoVs formed a unique group of CoV, 
which were subsequently classified as a novel genus of CoV, Deltacoronavirus (4). 
Recently, we have further discovered seven additional deltacoronaviruses: one from 
pigs, named porcine coronavirus HKU15 (PorCoV HKUI5), and six from birds, 
named white-eye coronavirus HKU16 (WECoV HKUI16), sparrow coronavirus 
HKU17 (SpCoV HKUI7), magpie robin coronavirus HKU18 (MRCoV HKUIS), 


night-heron coronavirus HKU19 (NHCoV HKU19), wigeon coronavirus HKU20 


(WiCoV HKU20) and common-moorhen coronavirus HKU21 (CMCoV HKU21) (32). 


Subsequently, PorCoV HKUI5 was found to be present widely in pigs and Asia and 
North America, and has been associated with fatal outbreaks in pig farms (33—42). 
The findings supported that deltacoronaviruses have the potential for avian-to- 
mammalian transmission and emergence in mammals. We hypothesize that there are 
other previously unrecognized deltacoronaviruses present in geographical locations 
other than Hong Kong, which may also have the potential for emergence. To test this 
hypothesis, we carried out a molecular epidemiology study in 1,356 mammals and 


birds in Dubai, The United Arab Emirates in the Middle East. Based on the results of 
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comparative genome and phylogenetic analyses, we propose four novel CoVs in 


Deltacoronavirus. The results of sero-epidemiological studies were also discussed. 
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RESULTS 

Animal surveillance and identification of four novel deltacoronaviruses. A total 
of 1,356 fecal samples from 1,164 mammals and 192 birds were obtained (Table 1). 
Reverse transcription-polymerase chain reaction (RT-PCR) for а 440-bp fragment in 
the RNA-dependent-RNA polymerase (RdRp) genes of CoVs was positive in 
specimens from eight birds of four species. Sequencing results suggested the presence 
of four novel deltacoronaviruses: the first (falcon CoV [FalCoV] UAE-HKU27]) from 
one falcon (family Falconidae), the second (houbara CoV [HouCoV] UAE-HKU28) 
from one houbara bustard (family Otididae), the third (pigeon CoV [PiCoV] UAE- 
HKU29) from one pigeon (family Columbidae) and the fourth (quail CoV [QuaCoV] 
UAE-HKU30) from five quails (family Phasianidae) (Table 1, Fig. 1). None of the 
1,164 mammals tested were positive. 

Genome organization and coding potentials of the four novel 
deltacoronaviruses. The complete genome sequences of one strain each of FalCoV 
UAE-HKU27, HouCoV UAE-HKU28 апа PiCoV UAE-HKU29 and two strains of 
QuaCoV UAE-HKU30 were obtained by assembly of the sequences of RT-PCR 
products from the RNA extracted from the corresponding individual specimens. 

The genome sizes of the four novel CoVs ranged from 25,871 (QuaCoV UAE- 
HKU30 strain 1101F) to 26,162 bases (FalCoV UAE-HKU27 strain 988F) and their 
С + С contents ranged from 0.39 to 0.42 (Table 2). Their genome organizations are 
typical of CoVs, with the gene order 5'-replicase ORF lab, spike (S), envelope (E), 
membrane (М), nucleocapsid (М)-3' (Fig. 2 and Table 3). Both 5’ and 3” ends contain 
short untranslated regions. The replicase ORFlab occupies 18.363 to 18.678 kb of the 
genomes (Table 3). This open reading frame (ORF) encodes a number of putative 


proteins, including nsp3 (which contains the putative papain-like protease [РГР“]), 
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nsp5 (putative chymotrypsin-like protease ІЗСІ.? |), nsp12 (putative RdRp), nsp13 
(putative helicase) and other proteins of unknown functions. Overall, the cleavage 
sites for the non-structural proteins in ORFlab of FalCoV UAE-HKU27, HouCoV 
UAE-HKU28, PiCoV UAE-HKU29 and QuaCoV UAE-HKU30 were similar to other 
deltacoronaviruses, except for nsp3/nsp4 and nspl5/16 in FalCoV UAE-HKU27, 
HouCoV UAE-HKU28 апа PiCoV UAE-HKU29 (Table 4). In fact, the amino acids 
downstream to the putative cleavage site at nsp3/nsp4 are quite variable across 
different deltacoronaviruses (Table 4). As for the amino acids downstream to the 
putative cleavage site at nsp15/nsp16, they аге AL instead of SL (Table 4). 

The four novel CoVs display a similar genome organization and differ only in 
the number of ORFs downstream of N (Fig. 2). Their transcription regulatory 
sequences (TRSs) conform to the consensus motif 5'-ACACCA-3' (Table 3), which is 
unique to deltacoronaviruses. Interestingly, similar to other deltacoronaviruses, the 
perfect TRS of S in the genomes of the four novel CoVs were separated from the 
corresponding AUG by a large number (140) of bases (Table 3). This is in contrast to 
the relatively small number of bases between the TRS for S and the corresponding 
AUG (range: 0 base in HCoV NL63, Rhinolophus bat coronavirus HKU2 [Rh- 
BatCoV HKU2], HCoV HKUI, bovine coronavirus [BCoV], human coronavirus 
OC43 [HCoV OC43], mouse hepatitis virus [MHV], porcine hemagglutinating 
encephalomyelitis virus, SARS-CoV and SARS-related Rhinolophus bat coronavirus 
HKU3 [SARSr-Rh-batCoV HKU3] to 52 bases in infectious bronchitis virus ПВУ)) 
in alphacoronaviruses, betacoronaviruses and gammacoronaviruses. Similar to other 
deltacoronaviruses, the genomes of the four novel CoVs contain putative РІ", which 
are homologous to PL2P? of alphacoronaviruses and betacoronavirus subgroup А and 


PLP? of betacoronavirus subgroups B, С and D and gammacoronaviruses. Besides, 
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one ORF (NS6) is found between М and М of the genomes of the four novel CoVs. In 
all the four novel CoVs, one ORF (NS7a) is present overlapping with N. For FalCoV 
UAE-HKU27, HouCoV UAE-HKU28 апа PiCoV UAE-HKU29, three ORFs (NS7b, 
Тс and 7d), and for QuaCoV UAE-HKU30, two ORFs (NS7b and 7c) are present 
downstream of N. BLAST search revealed no amino acid similarities between these 
putative non-structural proteins and other known proteins and no functional domain 
was identified by PFAM and InterProScan, except that NS7a of FalCoV UAE- 
HKU27, HouCoV UAE-HKU28 апа PiCoV UAE-HKU29 were homologous to NS7a 
of WECoV HKU16, NS7b homologous to NS7b of CMCoV HKU21 and NS7c 
homologous to NS7a of ThCoV HKUI2, and that NS7a, NS7b and NS7c of QuaCoV 
UAE-HKU30 were homologous to NS7a, NS7b апа NS7c of SpCoV HKUIT7, 
respectively. Transmembrane helices, predicted by TMHMM, were found only in 
NS7c of FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 at 
positions 4 to 23 and 30 to 52 among all the putative accessory proteins downstream 
to the N genes of FalCoV UAE-HKU27, HouCoV UAE-HKU28, PiCoV UAE- 
HKU29 and QuaCoV UAE-HKU30. Each of the genomes of FalCoV UAE-HKU27, 
HouCoV UAE-HKU28, PiCoV UAE-HKU29 and QuaCoV UAE-HKU30 contains а 
stem-loop II motif (s2m) (residues 25,924 to 25,965, 25,924 to 25,965, 25,932 to 
25,973 and 26,649 to 26,690, respectively), a conserved RNA element downstream of 
N and upstream of the polyA tail, similar to those in IBV, TCoV, SARSr-Rh-BatCoV, 
SARS-CoV as well as other deltacoronaviruses. 

Comparison of the amino acid identities of the seven conserved replicase 
domains for species demarcation (ADRP, nsp5 [СТР], nsp12 [RdRp], nsp13 [Hel], 
nsp14 [ExoN], nsp15 [NendoU] and nsp16 (О-МТ)) (4) among the four novel CoVs 


is shown in Tables 5A and 5B. In all the seven domains, the amino acid sequences of 
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FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 showed 
more than 90% identity among each other, but their overall amino acid sequences 
showed only 84.4% identity to those of WECoV HKU16, indicating that these three 
CoVs should be subspecies of a novel CoV species (Table 5A). As for QuaCoV UAE- 
HKU30, its overall amino acid sequences showed more than 90% identity to those of 
PorCoV HKUI5 апа SpCoV HKUI17, indicating that QuaCoV UAE-HKU30, 
PorCoV HKUI15 and SpCoV HKUI7 should be subspecies of the same CoV species 
(Table 5B). 

Phylogenetic analyses. The phylogenetic trees reconstructed using the 
nucleotide sequences of ЗСГР“, RdRp, Hel, S and N of the four novel CoVs and other 
deltacoronaviruses are shown in Fig. 3 and the corresponding pairwise amino acid 
identities shown in Table 2. In all five trees, FalCoV UAE-HKU27, HouCoV UAE- 
HKU28 and PiCoV UAE-HKU29 were clustered together (Fig. 3). In the 3CL?”, 
RdRp, Hel and N trees, FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV 
UAE-HKU29 were clustered with WECoV HKU16, whereas QuaCoV UAE-HKU30 
was clustered with PorCoV HKU15 and SpCoV HKU17 (Fig. 3). However, in the S 
tree, FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 were 
most closely related to MRCoV HKU18, whereas QuaCoV UAE-HKU30 was most 
closely related to MunCoV HKU13 (Fig. 3). 

Recombination analysis. For the FalCoV UAE-HKU27, HouCoV UAE- 
HKU28 апа PiCoV UAE-HKU29 cluster and QuaCoV UAE-HKU30, bootscan 
analysis showed possible recombination sites in the S gene (positions 20,300—24,300 
for the FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 


cluster [Fig. 4A] and positions 19,900—23,300 for QuaCoV UAE-HKU30 [Fig. 4B ]). 
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Estimation of divergence dates. Using the Bayesian Skyline under a relaxed 
clock model with an uncorrelated lognormal distribution, the mean evolutionary rate 
of CoVs was estimated as 1.027 x 10“ nucleotide substitutions per site per year for 
the RdRp gene. Molecular clock analysis using the RdRp gene showed that the mean 
time to the most recent common ancestor (МЕСА) of Deltacoronavirus was 
estimated to be April 82 (95% highest posterior density [HPD], 822 BC to 1824), that 
of HouCoV UAE-HKU28/PiCoV UAE-HKU29 to be March 2011 (95% HPD, 
October 2009 to February 2013), that of FalCoV UAE-HKU27/HouCoV UAE- 
HKU28/PiCoV UAE-HKU29 to be June 1981 (95% HPD, January 1964 to June 2010) 
and that of QuaCoV UAE-HKU30 to be February 1954 (95% HPD, October 1921 to 
September 2007) (Fig. 5). 

Western blot analysis. Prominent immunoreactive bands were visible for 33 
(7596) of 44 falcon serum samples. The band size observed (42 kDa) was consistent 
with the expected size of 41.1 kDa for the full-length Hisg-tagged recombinant N 


protein (Fig. 6). 
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DISCUSSION 
Similar to birds in Hong Kong, birds in the Middle East are also hosts for a diversity 
of deltacoronaviruses. In 2009, we reported the discovery of three novel avian CoVs 


that were phylogenetically distinct from infectious bronchitis virus (31). These three 


CoVs were found in fecal samples of bulbuls, thrushes and munias in Hong Kong (31). 


In 2011, the ICTV approved the classification of these three avian CoVs as a novel 
genus Deltacoronavirus in the Coronaviridae family (4). In 2012, in a large 
epidemiological study, we discovered seven additional deltacoronaviruses (32). Six of 
them were from fecal samples of Japanese white eyes, tree sparrows, original magpie 
robins, black-crowned night herons, Eurasian wigeons, and common moorhens 
respectively and one from fecal samples of pigs in Hong Kong (32). In the last few 
years, PorCoV HKU15 (now officially named Coronavirus HKU15) has been widely 
detected in fecal samples of pigs in Canada, China, Laos, Mexico, South Korea, 
Thailand, Vietnam and the USA (33-41). Recently, we have also found PorCoV 
HKUIS in respiratory samples of pigs which may have implications on the possible 
routes and sites of infections (42). In this study, four additional novel 
deltacoronaviruses were detected in the fecal samples of falcons, houbara bustards, 
pigeons and quails in Dubai. Similar to the other deltacoronaviruses, FalCoV UAE- 
HKU27, HouCoV UAE-HKU28, PiCoV UAE-HKU29 and QuaCoV UAE-HKU30 
also have large genome sizes of 25,871 to 26,162 bases, owing to the presence of 
multiple ORFs downstream to the N gene. Continuous surveillance studies on birds in 
the Middle East and other regions will help better understand the viral and host 
diversity of deltacoronaviruses, and their potential for emergence in mammals. 

Recent interspecies jumping events were observed in the deltacoronaviruses 


from falcons, houbara bustards and pigeons, which were likely a result of predator- 
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and-prey relationship along the food chain. According to Фе ІСТУ definition for 
demarcation of CoV species, where CoVs that share an overall amino acid identity of 
more than 90% in their seven conserved replicase domains (ADRP, nsp5 [3СТР“], 
nsp12 [RdRp], nsp13 [Hel], пзр14 [ExoN], nsp15 [NendoU] and пзр16 [O-MT]) 
should be regarded as the same species, FalCoV UAE-HKU27, HouCoV UAE- 
HKU28 and PiCoV UAE-HKU29 should be subspecies of a novel CoV species. 
Notably, the S proteins of FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV 
UAE-HKU29, which are responsible for CoV receptor binding, also shared 94.5— 
99.8% amino acid identities, suggesting that the viruses have not evolved much yet to 
adapt to the corresponding avian host after jumping from one species to another. In 
fact, molecular clock analysis estimated that the tMRCA of HouCoV UAE-HKU28 
and PiCoV UAE-HKU29 was just around seven years ago and that of FalCoV UAE- 
НКО27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 was just around 38 years 
ago (Fig. 5). Nevertheless, it should be noted that results of molecular clock 
estimation are only speculative for RNA viruses which are known for episodic 
evolution and adaptation to different environments (43). It 1s interesting to note that 
falcons, houbara bustards and pigeons are three radically different types of birds with 
unique behaviors and habitats. Falcons (order Falconiformes, family Falconidae) are 
medium-sized birds of prey traditionally used for hunting wild quarry in the Arabian 
region. Houbara bustards (order Otidiformes, family Otididae) are large birds that are 
geographically restricted to arid habitats. Pigeons (order Columbiformes, family 
Columbidae) are relatively smaller and are globally distributed. Yet, these birds are 
ecologically closely related because falcons are known predators of pigeons and 
houbara bustards, and are also fed with the meat of these birds (44, 45). Moreover, 


falcons are trained by Arabian falconers to hunt houbara bustards because Arabs and 
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other Asians like Pakistanis believe their meat possesses aphrodisiac qualities and the 
meat is sold at high prices, despite banning of such controversial practices in some 
countries. Therefore, interspecies transmission of this CoV species is likely a result of 
the predator-and-prey relationship. Serological testing confirmed the presence of 
specific antibodies against FalCoV UAE-HKU27 in 75% of field falcon sera collected 
in Dubai. Such a relatively high seroprevalence is similar to those observed in other 
coronaviruses found in other animals, such as MERS-CoV and dromedary CoV UAE- 
HKU23 іп dromedaries and rabbit CoV HKUIA in rabbits (7, 46), supporting 
widespread genuine infection among the falcon population and excluding the 
possibility of remnant viruses in falcon fecal samples resulting from ingestion of 
pigeons and houbara bustards. The detection of viruses from only one pigeon was not 
surprising, as viral shedding is often transient during the acute infection phase. 
Further molecular studies will reveal whether these three closely related 
deltacoronaviruses share a common receptor in these three phylogenetically distant 
birds, where viral adaptation to an evolutionarily conserved host-cell receptor might 
help offer facile interspecies transmissibility (47). 

Recombination involving the S protein is likely a common phenomenon 
among deltacoronaviruses, which may facilitate interspecies transmission and 
adaptation to new animal hosts. Phylogenetic analysis showed that the CoV species 
comprising FalCoV UAE-HKU27, HouCoV UAE-HKU28 and PiCoV UAE-HKU29 
was most closely related to WECoV HKU16 in the 3CL??, RdRp, Hel and N genes, 
but was only distantly related to WECoV HKU16 and most closely related to MRCoV 
HKUIS in the S gene, suggesting recombination events around the S gene region as 
demonstrated by bootscan analysis (Fig. 4). As for QuaCoV UAE-HKU30, PorCoV 


НКО15 and SpCoV HKUIT7, according to the ICTV definition for demarcation of 
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CoV species, these three CoVs should be classified as subspecies of the same CoV 
species. However, in contrast to FalCoV UAE-HKU27, HouCoV UAE-HKU28 and 
PiCoV UAE-HKU29 which are clustered together in all phylogenetic trees, QuaCoV 
UAE-HKU30, PorCoV HKUI5 and SpCoV HKUI7 showed phylogenetic positions 
shifting in different phylogenetic trees. QuaCoV UAE-HKU30 was clustered with 
PorCoV HKU15 апа SpCoV HKUT7 in the ЗСТР“, RdRp, Hel and N genes, whereas 
it was most closely related to MunCoV-HKU13 in the S gene (Fig. 3). Recombination 
around the S gene region was also demonstrated by bootscan analysis (Fig. 4). 
Similarly, PorCoV HKUI5 and/or Asian leopard cat CoV (ALCCoV) were most 
closely related to SpCoV HKUI7 in the 3CL”°, RdRp, Hel and N genes, but were 
only distantly related to SpCoV HKUI7 and most closely related to BuCoV HKU11 
in the S gene. This suggests that these mammalian deltacoronaviruses may have 
arisen from recombination events between SpCoV HKUI7 and BuCoV HKUII or 
related viruses. Recombination is not uncommon in other CoVs, and was found to be 
responsible for the emergence of SARS-CoV (48—51), generation of new genotypes or 
strains of other CoVs including human CoV HKUI, human CoV ОСАЗ and feline 
CoV type II strains (5, 6, 52). The present results suggest that recombination is also 
common among deltacoronaviruses, with the S gene being a frequent recombination 
site. Further studies may reveal if this may be an important mechanism for 
overcoming the mammalian species barrier through more efficient receptor binding to 


swine or other mammalian cells in PorCoV HKU15 and related viruses. 
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MATERIALS AND METHODS 

Ethical statement. Collection of animal fecal samples and field falcon sera in this 
study were approved by the Animal Ethic Committee of Central Veterinary Research 
Laboratory and Ministry of Climate Change and Environment, UAE according to the 
Ministerial Decree No. 384 of year 2008 on the executive by-law of the Federal Law 
No. 16 of year 2007 concerning animal welfare. All the experimental procedures were 
performed in accordance with the International Guiding Principles for Biomedical 
Research Involving Animals regarding the care and use of animals. 

Animal surveillance and sample collection. All animal fecal samples were 
left-over specimens submitted to the Central Veterinary Research Laboratory in Dubai, 
The United Arab Emirates for pathogen screening over a 24-month period (January 
2013 to December 2014). A total of 1,356 fecal samples from 1,164 mammals and 
192 birds were tested (Table 1). Serum samples from falcons in the field were 
collected in Dubai over а 4-month period (November 2015—February 2016). АП 
samples were taken from the caudal tibial vein (vena metatarsalis plantaris 
superficialis) under isoflurane-anesthesia in serum tubes, centrifuged and stored at — 
20°C until use. 

RNA extraction. Viral RNA was extracted from the fecal samples using 
RNeasy Mini Кїї (Qiagen, Germany). The RNA was eluted in 50 ul of RNase-free 
water and was used as the template for RT-PCR. 

RT-PCR of RdRp gene of CoVs using deltacoronavirus-conserved 
primers and DNA sequencing. Initial CoV screening was performed by amplifying a 
440-bp fragment of the RdRp gene of CoVs using deltacoronavirus-conserved primers 
(LPW16472 5'-GTGGVTGTMTTAATGCACAGTC-3' and LPW 16473 5- 


TACTGYCTGTTRGTCATRGTG-3') we published previously (32). Reverse 
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transcription was performed using the SuperScript II reverse transcriptase 
(Invitrogen, Carlsbad, CA). The PCR mixture (25 ul) contained cDNA, PCR buffer 
(10 mM Tris-HCl pH 8.3, 50 mM KCI, 3 mM MgCl, and 0.01% gelatin), 200 uM of 
each dNTPs and 1.0 U Taq polymerase (Applied Biosystem, Foster City, CA). The 
mixtures were amplified in 60 cycles of 94°C for 1 min, 48°C for 1 min and 72°C for 
1 min and a final extension at 72°C for 10 min in an automated thermal cycler 
(Applied Biosystem). Standard precautions were taken to avoid PCR contamination 
and no false-positive was observed in negative controls. 

The PCR products were gel-purified using the QIAquick Gel Extraction Кїї 
(Qiagen). Both strands of the PCR products were sequenced twice with the ABI Prism 
3130х/ Genetic Analyzer (Applied Biosystems), using the two PCR primers. The 
sequences of the PCR products were compared with known sequences of the КаКр 
genes of CoVs in the DDBJ/ENA/GenBank sequence databases. A phylogenetic tree 
was reconstructed using 371-Ыр fragments of the RdRp gene with the maximum 
likelihood method using the substitution model general time reversible with gamma 
distributed rate variation as well as estimated proportion of invariable sites 
(СТК--С-Н) by PhyML.3.0. 

Complete genome sequencing. One complete genome each of FalCoV UAE- 
HKU27, HouCoV UAE-HKU28 апа PiCoV UAE-HKU29 and two complete 
genomes of QuaCoV UAE-HKU30 were amplified and sequenced using the RNA 
extracted from the original fecal specimens as templates. The RNA was converted to 
cDNA by a combined random-priming and oligo(dT) priming strategy. The cDNA 
was amplified by degenerate primers designed by multiple alignments of the genomes 
of other CoVs with complete genomes available, using strategies described in our 


previous publications (7, 18, 29). Additional primers were designed from the results 


18 


Аисләлїиг| [eoipejy Bunisuoey Aq 8102 ‘81 Aen uo /B10 шелуу: ац шолу рореојимод 


Ф 
= 
6 
го 

Ф 

— 

(72) 

О 
jae 

— 

о. 
= 

(9) 

(72) 

=) 

(= 
5 
го 

e 

[ox 

(15) 

(9) 

9) 
< 


Journal of Virology 


Journal of Virology 





380 


381 


382 


383 


384 


385 


386 


387 


388 


389 


390 


391 


392 


393 


394 


395 


396 


397 


398 


399 


400 


401 


402 


403 


404 


of the first and subsequent rounds of sequencing. The 5’ ends of the viral genomes 
were confirmed by rapid amplification of cDNA ends using the SMARTer 573' 
RACE kit (Clontech, Mountain View, CA). Sequences were assembled and manually 
edited to produce final sequences of the viral genomes. 

Genome analysis. The nucleotide sequences of the genomes and the deduced 
amino acid sequences of the ORFs were compared to those of other CoVs using 
ORFfinder (https://www.ncbi.nlm.nih.gov/orffinder/). Phylogenetic tree 
reconstruction was performed using the maximum likelihood (ML) method with 
PhyML3.0. The best-fit substitution models were selected using PhyML with Smart 
Model Selection. Protein family analysis was performed using PFAM and 
InterProScan (53, 54). Prediction of transmembrane domains was performed using 
TMHMM (55). 

Recombination analysis. To detect possible recombination events, bootscan 
analysis was performed by using the nucleotide alignment of the genome sequences of 
the novel deltacoronaviruses with Simplot version 3.5.1, as previously described 
(56). The analysis was conducted using a sliding window of 1,000 nucleotides moving 
in 200 nucleotide steps with 1,000 bootstrap values. Possible recombination sites 
suggested by the bootscan analysis were confirmed through multiple sequence 
alignments. 

Estimation of divergence dates. Divergence times for the genus 
Deltacoronavirus was calculated using a Bayesian Markov Chain Monte Carlo 
(MCMC) approach as implemented in BEAST (Version 1.7.4) as described 
previously (50, 57). RdRp gene sequence data were selected for analyses under the 
substitution model GTR+G+I using an unrelaxed log-normal distributed (Ucld) 


relaxed molecular clock and a Bayesian Skyline coalescent model. MCMC run was 5 
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x 107 steps long, sampling every 1,000 steps. Convergence was assessed on the basis 
of the effective sampling size after a 10% burn-in using Tracer version 1.6. МЕСА 
and the HPD regions at 95% were calculated. The trees were summarized in a target 
tree by the Tree Annotator program included in the BEAST package by choosing the 
tree with the maximum sum of posterior probabilities (maximum clade credibility) 
after a 10% burn-in. 

Cloning and purification of His-tagged recombinant N protein of FalCoV 
UAE-HKU27. Cloning and purification of the His-tagged recombinant N protein of 
FalCoV UAE-HKU27 was performed using a protocol we described previously (46, 
58). To produce a plasmid for protein purification, primers LPW36358 (5'- 
ССААТТССАТАТСАТСАССАСТСССАСАСТСССТ-3) апа LPW36359 (5- 
CCGCTCGAGATGCAGTTGAATCTCCATCCTG-3’) were used to amplify the gene 
encoding the N protein of FalCoV UAE-HKU27 by RT-PCR. The sequence coding 
for amino acid residues 1 to 344 of the N protein was amplified and cloned into the 
Ndel and Xhol sites of the expression vector pET-28b(+) (Merck, Germany) in-frame 
and upstream of the histidine residue series. The recombinant N protein was 
expressed and purified using Ni-NTA agarose (Qiagen) according to the 
manufacturer’s instructions. 

Western blot analysis. Western blot analysis was performed according to our 
published protocol (46, 58). Briefly, 600 ng of purified His¢-tagged recombinant N 
protein of FalCoV UAE-HKU27 was loaded into the well of a sodium dodecyl sulfate 
(SDS)-10% polyacrylamide gel and subsequently electrophoresed апа then 
electroblotted onto a nitrocellulose membrane (Bio-Rad, Hercules, CA). The blot was 
cut into strips and each separated strip was incubated with an individual serum sample, 


in a dilution of 1:1,000, obtained from falcons in Dubai. Antigen-antibody interaction 
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was detected with an in-house developed polyclonal guinea pig anti-falcon IgY 
antibody (59) at a dilution of 1:2,000, a horseradish peroxidase (HRP)-conjugated 
rabbit anti-guinea pig IgG antibody (Invitrogen) at a dilution of 1:4,000, and the 
WesternBright Quantum HRP substrate (Advansta, USA). Monoclonal anti-Hisg 
antibody (clone HIS.H8; Invitrogen) at a concentration of 0.5 u g/mL, with HRP- 
conjugated goat anti-mouse IgG antibody (Invitrogen) at a dilution of 1:4,000 as the 
secondary antibody, was used as the positive control. 

Nucleotide sequence accession numbers. The nucleotide sequences of the 
five complete genomes of FalCoV UAE-HKU27, HouCoV UAE-HKU28, PiCoV 
UAE-HKU29 and QuaCoV UAE-HKU30 have been lodged into the 
DDBJ/ENA/GenBank sequence databases with the accession numbers І/С364342- 


LC364346. 
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FIG. 1. Phylogenetic analysis of amino acid sequences of the 371-bp fragment 
(excluding primer sequences) of RNA-dependent RNA polymerase (RdRp) of 
coronaviruses (CoVs) identified from birds from Dubai in the present study. The tree 
was reconstructed by the maximum-likelihood method using PhyML 3.0 with the 
substitution model general time reversible with gamma distributed rate variation and 
estimated proportion of invariable sites (GTR+G+I). Bootstrap values were calculated 
from 1,000 trees. The scale bar indicates the number of nucleotide substitutions per 
site. The eight newly identified coronaviruses are shown in bold. List of viruses and 
their respective DDBJ/ENA/GenBank accession numbers are as follow: ALCCoV, 
Asian leopard cat coronavirus (EF584908); Badger SARS-CoV, SARS-related 
Chinese ferret badger CoV (AY545919); BCoV, bovine CoV (NC_003045); BdCoV 
HKU22, bottlenose dolphin CoV HKU22 (КЕ793826); BuCoV HKU11, bulbul CoV 
HKUI1 (FJ376619); BWCoV SWI, Beluga whale CoV SW1 (МС 010646); Camel 
MER-CoV, Camel Middle East respiratory syndrome CoV (KT751244); ChRCoV 
НКО24, China Rattus CoV HKU24 (KM349742); Civet SARS-CoV, SARS-related 
palm civet CoV (AY304488); CMCoV HKU21, common-moorhen CoV HKU21 
(NC_016996); DcCoV HKU23, dromedary camel CoV HKU23 (KF906251); FalCoV 
UAE-HKU27, falcon CoV UAE-HKU27; FIPV, feline infectious peritonitis virus 
(AY994055); GiCoV, giraffe CoV (EF424622); HouCoV UAE-HKU28, houbara 
CoV UAE-HKU28; HCoV 229E, human CoV 229Е (МС 002645); HCoV HKUI, 
human CoV HKUI (NC_006577); HCoV NL63, human CoV NL63 (МС 005831); 
HCoV OC43, human CoV OC43 (NC_005147); Human MERS-CoV, human Middle 
East respiratory syndrome CoV (JX869059); Human SARS-CoV, severe acute 


respiratory syndrome-related human CoV (МС 004718); IBV, infectious bronchitis 
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virus (NC_001451); IBV-partridge, partridge coronavirus (А Ү646283); IBV-peafowl, 
peafowl coronavirus (AY641576); MHV, murine hepatitis virus (МС 001846); 
MRCoV HKU18, magpie robin CoV HKU18 (МС 016993); MunCoV HKUI3, 
munia CoV HKUI3 (FJ376622); NHCoV HKU19, night-heron CoV HKUI9 
(NC_016994); PEDV, porcine epidemic diarrhea virus (NC_003436); PHEV, porcine 
hemagglutinating encephalomyelitis virus (NC_007732); PiCoV UAE-HKU29, 
pigeon CoV UAE-HKU29; Pi-BatCoV НКО5, Pipistrellus bat CoV HKUS5 
(МС 009020); PorCoV HKU15, porcine CoV HKU15 (МС_016990); PRCV, porcine 
respiratory CoV (DQ811787); QuaCoV UAE-HKU30, quail CoV UAE-HKU30; 
RbCoV HKUIA, rabbit CoV HKU14 (JN874559); Rh-BatCoV HKU2, Rhinolophus 
bat CoV HKU2 (EF203064); Ro-BatCoV HKU9, Rousettus bat CoV HKU9 
(NC_009021); SACoV, sable antelope CoV (EF424621); SARSr-Rs-BatCoV HKU3, 
SARS-related Rhinolophus bat CoV HKU3 (00022305); Sc-BatCoV 512, 
Scotophilus bat CoV 512 (NC 009657); SpCoV HKU17, sparrow CoV HKUI7 
(NC_016992); TGEV, transmissible gastroenteritis virus (NC_002306); ThCoV 
HKU12, thrush CoV HKU12 (FJ376621); Ty-BatCoV HKU4, Tylonycteris bat CoV 
HKU4 (NC 009019); WECoV HKU16, white-eye CoV HKUI6 (NC_016991); 
WiCoV HKU20, wigeon CoV HKU20 (NC_016995). 

FIG. 2. Genome organization of members of Deltacoronavirus. Open reading frames 
downstream of spike (S) gene are magnified to show the differences among the 
genomes of the 10 CoVs. Papain-like protease (РІ), chymotrypsin-like protease 
(СТР), and RNA-dependent RNA polymerase (RdRp) genes are represented by 
green boxes. S, envelope (E), membrane (M) and nucleocapsid (N) genes are 


represented by orange boxes. Putative accessory proteins are represented by blue 
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boxes. The novel coronaviruses discovered in this study are shown in bold. 
Abbreviations for the viruses are the same as those in Fig. 1. 

FIG. 3. Phylogenetic analyses of chymotrypsin-like protease (3CL”°), RNA- 
dependent RNA polymerase (RdRp), helicase (Hel) spike (S) protein апа 
nucleocapsid (М) protein of Falcon CoV-HKU27, Houbara CoV-HKU28, Pigeon 
CoV-HKU29 and Quail CoV-HKU30. The trees were reconstructed by the maximum- 
likelihood method using PhyML 3.0 with the substitution models Le and Gascuel (LG) 
with gamma distributed rate variation (С) (ЗСІ"); LG with С, estimated proportion 
of invariable sites (I) and empirical frequencies (F) (RdRp); LG+G+F (Hel апа М); 
and Whelan and Goldman (WAG)+G+I+F (S) with bootstrap values calculated from 
1,000 trees. 314, 944, 599, 1561 апа 392 amino acid positions in ЗСІ?, RdRp, Hel, 
S and N, respectively, were included in the analyses. The scale bars indicate the 
number of amino acid substitutions per site. Viruses characterized in this study are in 
bold. Abbreviations for the viruses are the same as those in Fig. 1. 

FIG. 4. Detection of possible recombination by bootscan analysis. Bootscanning was 
conducted with Simplot version 3.5.1 (F84 model; window size, 1,000 bp; step, 
200 bp). (A) Falcon CoV UAE-HKU27 (FalCoV UAE-HKU27) was used as the 
query sequence and compared with the genome sequences of white-eye coronavirus 
HKU16 (WECoV HKU 16), magpie robin coronavirus HKU18 (MRCoV HKUI18) and 
ThCoV HKU12 thrush coronavirus HKU12 (ThCoV HKU12). (B) Quail CoV UAE- 
HKU30 (QuaCoV UAE-HKU30) was used as the query sequence and compared with 
the genome sequences of sparrow coronavirus HKU17 (SpCoV HKUI7), munia 
coronavirus HKU13 (MunCoV HKU13) and ThCoV HKUI2. 

FIG. 5. Estimation of the mean time to the most recent common ancestor (tMRCA) 


for Deltacoronavirus. The time-scaled phylogeny was summarized from all Markov 
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Chain Monte Carlo (MCMC) phylogenies of the RNA-dependent RNA polymerase 
(RdRp) gene data set analyzed under the relaxed-clock model with an uncorrelated 
log-normal distribution in BEAST version 1.7.4. Viruses characterized in this study 
are in bold. Abbreviations for the viruses are the same as those in Fig. 1. 

FIG. 6. Western blot analysis of falcon CoV UAE-HKU27 (FalCoV UAE-HKU27) 
using nucleocapsid (N) protein expressed in Escherichia coli. Lane 1, purified 
recombinant FalCoV UAE-HKU27 М protein; Lane 2, falcon serum sample (FS7) 
strongly positive for antibody against FalCoV UAE-HKU27 N protein; Lane 3: falcon 


serum sample (FS5) negative for antibody against FalCoV UAE-HKU27 М protein. 
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Table 1. Animals screened and their associated coronaviruses (CoVs) in the present 











study 
Animals No. of No. (%) of CoV 
specimens specimens 
tested positive for CoV 
Birds 192 8 (4.16%) 
Black swan 1 0 - 
Crowned crane 1 0 - 
Eclectus parrot 1 0 - 
Falcon 34 1 (2.94%) Falcon CoV UAE-HKU27 (n=1) 
Flamingo 3 0 - 
Grey parrot 1 0 - 
Heuglin’s bustard 10 0 - 
Houbara bustard 36 1 (2.77%) Houbara CoV UAE-HKU28 (п-1) 
Kori bustard 4 0 - 
Myna 1 0 - 
Ostrich 4 0 - 
Peacock 1 0 - 
Pigeon 18 1 (7.14%) Pigeon CoV UAE-HKU29 (n=1) 
Rhea 1 0 - 
Thick-knee 1 0 - 
Stone curlew 22 0 - 
Partridge 8 0 - 
Chicken 19 0 - 
Duck 2 0 - 
Guineafowl 6 0 - 
Pheasant 7 0 - 
Quail 10 5 (50%) Quail CoV UAE-HKU30 (п=5) 
Sand grouse 1 0 - 
Mammals 1,164 0 
Antelope 90 0 - 
Cat 59 0 - 
Cattle 3 0 - 
Camel 754 0 - 
Dog 145 0 - 
Goat 34 0 - 
Horse 44 0 - 
Lion 7 0 - 
Monkey 6 0 - 
Rabbit 9 0 - 
Rodent 13 0 - 
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Table 2. Comparison of genomic features and amino acid identities between the four novel deltacoronaviruses, representative members 
of alpha-, beta- and gammacoronaviruses and other deltacoronaviruses 





CoV Genome size (base) G+C content 


Amino acid identity (%) 





FalCoV UAE-HKU27 


HouCoV UAE-HKU28 


PiCoV UAE-HKU29 


QuaCoV UAE-HKU30 











3С” RdRp Hel S N ЗС” RdRp На S N 3CL™ RdRp Hel S N 3CL™ RdRp Hel S N 
Alphacoronavirus 
HCoV 229E 27,317 0.38 358 491 477 448 214 35.5 490 477 448 2 35.8 49.1 47.7 448 2 348 48.9 506 411 223 
HCoV NL63 27,553 0.34 367 48.9 475 393 198 367 488 475 395 2 370 489 475 394 2 360 48.8 496 377 215 
Betacoronavirus 
Lineage A 
BCoV 31,028 0.37 5 519 472 249 227 365 520 472 249 225 381 521 472 2: 225 372 514 486 254 234 
HCoV HKUI 29,926 0.32 381 516 472 25.5 201 378 516 472 258 201 369 517 472 2 201 381 512 4823 250 213 
Lineage В 
SARS-CoV 29,751 0.41 358 50.2 502 25.9 242 35.5 502 500 245 240 358 503 500 245 240 348 511 51.9 25.0 25.3 
Lineage С 
MERS-CoV 30,119 0.41 507 48.9 25.8 242 364 507 48.9 250 23.5 36. 50.8 489 2 23.5 358 514 5 233. 282 
Ty-BatCoV HKU4 30,286 0.38 3 512 48.9 251 24.2 363 512 48.9 253 241 36. 513 489 2 241 357 506 24.00 238 
Lineage D 
Ro-Bat-CoV HKU9 29,114 0.41 380 525 492 267 243 380 524 490 266 245 383 525 490 266 245 374 517 511 269 242 
Gammacoronavirus 
IBV 27,608 0.38 429 548 534 27.9 282 42.9 540 534 285 284 433 550 534 286 284 446 54.6 561 29.5 29.0 
Deltacoronavirus 
BuCoV НКІЛІ 26,476 0.39 850 888 945 452 715 844 891 947 45.0 721 85.0 889 947 45.2 721 811 87.5 894 681 73.4 
ThCoV HKU12 26,396 0.38 847 86.9 945 46.3 786 847 870 944 47.0 79.1 853 868 944 468 791 824 872 89.9 47.9 79.9 
MunCoV HKUI3 26,552 0.43 77.9 87.6 883 462 741 77.5 876 881 465 744 779 874 881 464 744 840 899 961 730 77.4 
PorCoV HKUI5 25,421 0.43 808 874 881 457 754 801 87.5 880 460 757 808 873 880 459 757 912 952 98.5 71.8 90.9 
МЕСОМ HKUI6 26,027 0.40 866 919 968 471 827 863 919 970 477 836 870 917 970 478 836 785 867 884 622 75.5 
SpCoV НКО17 26,067 0.45 808 87.5 882 673 759 801 87.6 881 679 76.8 808 874 881 677 768 938 958 987 447 918 
MRCoV НКО18 26,674 0.47 788 871 883 686 734 785 871 881 682 736 788 869 881 682 736 857 903 963 450 782 
NHCoV HKUI9 26,064 0.38 583 723 75.9 45.6 504 576 723 762 46.0 507 579 723 762 462 507 537 723 784 418 516 
WiCoV HKU20 26,211 0.39 583 721 73.8 45.6 49.3 577 720 738 45.7 508 583 71.9 738 457 508 590 713 754 43.0 514 
CMCoV HKU21 26,212 0.35 762 848 891 471 64.7 75.9 847 892 465 65.3 765 845 892 465 653 717 83.5 846 515 61.6 
FalCoV UAE-HKU27 26,155 0.39 - - - - - 993 989 99.5 94.5 100 993 987 995 945 991 818 863 883 458 742 
HouCoV UAE-HKU28 26,155 0.39 987 98.9 99.5 945 991 - - - - - 993 998 100 998 100 811 864 881 460 748 
PiCoV UAE-HKU29 26,162 0.39 993 987 99.5 945 991 987 998 100 99.8 991 - - - - - 818 862 881 461 748 
QuaCoV UAE-HKU30 25,871 0.42 80.8 863 88.3 45.8 748 801 864 881 460 748 808 862 881 461 748 - - - - - 
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788 Table 3. Coding potential and putative transcription regulatory sequences of novel 
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790 
CoVs ORF Location Frame Length (aa) Length (nt) TRS location TRS sequence 
(distance in bases to AUG) 
FalCoV UAE-HKU27 lab 595-19,271 +1,+3 6226 18,678 71 ACACCA (523) AUG 
5 19,253-22,855 +2 1201 3,603 19112 АСАССА (140) АСС 
Е 22,849-23,097 +1 83 249 22828 ACACCU (20) AUG 
M 23,090-23,746 +2 219 657 23072 ACACCA (17) AUG 
556 23,746–24,027 +1 94 282 23726 ACGCCA (19) AUG 
N 24,051-25,085 +3 345 1,035 24042 ACACCA (8) AUG 
NS7a 24,079-24,735 +1 219 657 24042 ACACCA (36) AUG 
NS7b 25,066-25,272 +1 69 207 25042 ACAACG (18) AUG 
NS7c 25,259-25,636 42 126 378 25244 AGACCU (14) AUG 
NS7d 25,629-25,826 +3 66 198 25618 AUACCA (10) AUG 
HouCoV UAE-HKU28 lab 588-19,264 — 43,42 6226 18,678 70 ACACCA (517) AUG 
S 19,246-22,848 +1 1201 3,603 19105 ACACCA (140) AUG 
E 22,842-23,090 +3 83 249 22821 ACACCU (20) AUG 
M 23,083-23,739 +1 219 657 23065 ACACCA (17) AUG 
NS6 23,739-24,020 +3 94 282 23719 ACGCCA (19) AUG 
N 24,044-25,078 +2 345 1,035 24035 ACACCA (8) AUG 
NS7a 24,072-24,728 43 219 657 24035 ACACCA (36) AUG 
Ns7b 25,059-25,265 +3 69 207 25040 ACAACG (18) AUG 
NS7c 25,252-25,629 +1 126 378 25237 AGACCU (14) AUG 
Ns7d 25,622-25,819 42 66 198 25611 AUACCA (10) AUG 
PiCoV UAE-HKU29 lab 588-19,264 43,2 6226 18,678 70 ACACCA (517) AUG 
S 19,246-22,848 +1 1201 3,603 19105 ACACCA (140) AUG 
> Е 22,842-23,090 +3 83 249 22821 ACACCU (20) AUG 
> M 23,083-23,739 +1 219 657 23065 ACACCA (17) AUG 
о NS6 23,739-24,020 +3 94 282 23719 АССССА (19) AUG 
> N 24,044-25,078 +2 345 1,035 24035 ACACCA (8) AUG 
5 NS7a 24,072-24,728 +3 219 657 24035 ACACCA (36) AUG 
-5 Ns7b 25,059-25,265 +3 69 207 25040 ACAACG (18) AUG 
с NS7c 25,252-25,629 +1 126 378 25237 AGACCU (14) AUG 
o Ns7d 25,622-25,819 42 66 198 25611 AUACCA (10) AUG 
ES 
QuaCoV UAE-HKU30 lab  522-19,3312 43,42 6121 18,363 62 ACACCA (459) AUG 
S 19,294—22,770 +1 1159 3,477 19153 АСАССА (140) АСС 
Е 22,764-23,015 +3 84 252 22740 СААССА (23) AUG 
M 23,008-23,661 +1 218 654 22987 ACACCA (20) AUG 
NS6 23,661–23,942 +3 94 282 23614 ACACCA (46) AUG 
N 23,967-24,902 +3 342 1,026 23959 ACACCA (7) AUG 
NS7a 24,061-24,663 +1 201 603 23978 GCTCCA (82) AUG 
NS7b 25,003-25,419 +1 139 417 24998 ACACCA (4) AUG 
М57с 25,358-25,579 +2 74 222 25347 ACACCA (10) AUG 
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791 Table 4. Putative cleavage sites at the junctions between non-structural proteins in FalCoV UAE-HKU27, HouCoV UAE-HKU28, PigCoV 
792 UAE-HKU29 and QuaCoV UAE-HKU30 compared with other deltacoronaviruses 














793 
mp uu m uos x xc е SS пи uus С Е Ч 
HKU27 HKU28 HKU29 HKU30 
nsp2/nsp3 AG/SD AG/SD AG/SD AG/AD VG/GL DG/VY AG/VS AG/SD AG/SD AG/SD AG/SD 
nsp3/nsp4 АС/АР AG/AR АС/АР АС/АМ TG/GN GG/SK АС/КЕ AG/RK AG/RK AG/RK АС/АР 
nsp4/nsp5 LQ/AG LQ/AG LQ/AG LQ/AG VQ/AG VQ/SG VQ/AG LQ/AG LQ/AG LQ/AG LQ/AG 
nsp5/nsp6 LQ/sG LQ/SN LQ/sG LQ/SG LQ/GT LQ/AN LQ/AS LQ/SG LQ/sG LQ/sG 10/88 
nsp6/nsp7 VQ/NK VQ/NK VQ/NK VQ/NK VQ/NK VQ/NR VQ/NR VQ/NR VQ/NR VQ/NR VQ/NK 
nsp7/nsp8 VQ/AV VQ/AV VQ/AV VQ/AV LQ/VV LQ/VV LQ/VV LQ/AV LQ/AV LQ/AV VQ/AV 
= nsp8/nsp9 LQ/NN LQ/NN LQ/NN LQ/NN LQ/NN CQ/NN LQ/NN LQ/NN LQ/NN LQ/NN LQ/NN 
ES nsp9/nsp10 LQ/AS LQ/AN LQ/AN LQ/AN LQ/SS LQ/AN LQ/AT LQ/AN LQ/AN LQ/AN LQ/AN 
= nsp10/nsp11_ ІО/М8 LQ/GS LQ/NS LQ/NS LQ/LG LQ/SN LQ/NT LQ/GS LQ/GS LQ/GS LQ/NS 
d nspl2/nspl3 — LQ/AS LQ/AS LQ/AS LQ/AS LQ/AT LQ/AT LQ/AS LQ/AS LQ/AS LQ/AS LQ/AS 
3 nspl3/nspla — LQ/SS LQ/SS LQ/SS LQ/AG VQ/SL VQ/AE VQ/CS LQ/SS LQ/SS LQ/SS LQ/SG 
с nspl4/nspl5 ІО/МІ, LQ/NL LQ/NL LQ/NL LQ/TL LQ/TL LQ/TI LQ/NL LQ/NL LQ/NL LQ/NL 
o nspl5/nspló — LQ/SL VQ/SL LQ/SL LQ/SL VQ/AL LQ/SL VQ/SL VQ/AL VQ/AL VQ/AL LQ/SL 
E" 794 
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795 Table 5A. Comparison of amino acid identities (%) of the seven conserved 
796 replicase domains for species demarcation among FalCoV UAE-HKU27, HouCoV 
797 | UAE-HKU?28, PiCoV UAE-HKU29, апа WECoV-HKU16 
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798 
CoV Domains WECoV FalCoV HouCoV PiCoV 
HKUI6 UAE-HKU27 ОАЕ-НКО28 UAE-HKU29 
WECoV ADRP - 68.4 68.7 68.8 
HKUI6 3CLP? - 86.6 86.3 87.0 
RdRp - 91.9 91.9 91.7 
Hel - 96.8 97.0 97.0 
ExoN - 93.6 93.8 93.8 
NendoU - 86.2 85.6 85.6 
O-MT - 89.2 89.2 88.9 
Concatenated — - 84.4 84.4 84.4 
FalCoV ADRP - - 98.6 98.6 
UAE-HKU27 3CLP° - - 98.7 98.7 
RdRp - - 98.9 98.7 
Hel - - 99.5 99.5 
ExoN - - 99.8 99.8 
NendoU - - 99.1 99.1 
O-MT - - 100 99.6 
= Concatenated — - - 99.] 99.] 
-9 
3 HouCoV ADRP - - - 99.9 
Es UAE-HKU28 ЗС - - - 99.3 
75 RdRp - - - 99.8 
3 Hel - - - 100 
ExoN - - - 100 
NendoU - - - 100 
O-MT - - - 99.6 
Concatenated - - - 99.9 
799 
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800 Table 5B. Comparison of amino acid identities (%) of the seven conserved 
801 replicase domains for species demarcation among QuaCoV UAE-HKU30, PorCoV- 
802 HKUI5, and SpCoV-HKU17 
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803 
CoV Domains PorCoV SpCoV QuaCoV 
HKUI5 HKUI7 UAE-HKU30 
PorCoV HKUI5 | ADRP - 89.3 83.0 
3CLP? - 97.1 91.2 
RdRp - 97.8 95.2 
Hel - 99.2 98.5 
ExoN - 97.3 96.1 
NendoU - 96.3 91.1 
O-MT - 97.5 96.4 
Concatenated - 95.0 91.3 
SpCoV HKUI7 АРВР - - 83.0 
3CL^? - - 93.8 
RdRp - - 95.8 
Hel - - 98.7 
ExoN - - 97.1 
NendoU - - 92.0 
O-MT - - 95.7 
> 
> Concatenated - - 91.8 
70 
= 804 
"o 
"o 
= 
= 
-9 
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